Review of Clustering Techniques for Finding the Similarity in Articles

نویسندگان

Usha Rani

Shashank Sahu

Jiawei Han

S. Anitha

J. Akilandeswari

Anoop Kumar

چکیده

Clustering is an important technique in data mining. It is a technique in which grouping of item taken place into the clusters in such a way that items of same cluster have more similarity than the items into another cluster, but is very dissimilar to the item in other clusters. The aim of document clustering is to make a set of clusters of given documents in such a way that document of each cluster have more similarity than the documents of other clusters. This paper reviews various techniques of clustering which can be divided mainly into two groups that are hierarchical and partitional clustering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

An Optimization K-Modes Clustering Algorithm with Elephant Herding Optimization Algorithm for Crime Clustering

The detection and prevention of crime, in the past few decades, required several years of research and analysis. However, today, thanks to smart systems based on data mining techniques, it is possible to detect and prevent crime in a considerably less time. Classification and clustering-based smart techniques can classify and cluster the crime-related samples. The most important factor in the c...

متن کامل

داده‌کاوی بالینی: مروری بر تکنیک‌های داده‌کاوی در دیابت

Background: Provide a health care service to the patients with diabetes provides useful information that could be used to identify, treatment, following up and prevention of diabetes. Explore and investigation of large volumes of data requires effective and efficient methods for finding hiding patterns in the data. The use of various techniques of data mining in particular Classification and Fr...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Review of Clustering Techniques for Finding the Similarity in Articles

نویسندگان

چکیده

منابع مشابه

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

An Optimization K-Modes Clustering Algorithm with Elephant Herding Optimization Algorithm for Crime Clustering

داده‌کاوی بالینی: مروری بر تکنیک‌های داده‌کاوی در دیابت

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

عنوان ژورنال:

اشتراک گذاری